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The invention features a synthetic 
gene encoding a protein normally ex- 
pressed in a mammalian cell or eu- 
karyotic cell wherein at least one non- 
preferred or less preferred codon in the 
natural gene encoding the mammalian 
protein has been replaced by a preferred 
codon encoding the same amino acid. 



1 


CAATTCACGC 


CTAAGCTTGC 


CGCCACCATG 


CTGAGCAAGG 


GCCACGAGCT 


51 


GTTCACCCGC 


CTGCTGCCCA 


TCCTGGTCGA 


GCTGGACGGC 


GACGTGAACG 


101 


GCCACAAGTT 


CAGCCTCTCC 


GGCGAGGGCG 


AGGGCGATCC 


CACCTACGGC 


151 


AAGCTGACCC 


TGAAGTTCAT 


CTGCACCACC 


G6CAAGCTCC 


CCGTGCCCTG 


201 


GCCCACCCTC 


GTCACCACCT 


TCACCTACCG 


CCTCCACTGC 


TTCAGCOGCT 


251 


ACCCCCXCCA 


CATCAACCAG 


CAOGACTTCT 


TCAACTCCCC 


CATCCCCGAA 


301 


CCCTACCTCC 


AGGAGCGCAC 


CATCTTCTTC 


AAGGACGACG 


GCAACTACAA 


351 


cxcccccccc 


GAGCTGAAGT 


TCGAGCGCGA 


CACCCTGGTG 


AACCGCATCC 


401 


AGCTGAAGGG 


CATCCACTTC 


AAOGXGGACG 


GCAACATCCT 


GGGGCACAAG 


451 


CTGGAGTACA 


ACTACAACAG 


CCACAACGTC 


TATATCATCC 


CCCACAAGCA 


501 


CAAGAACGGC 


ATCAACGTGA 


ACTTCAACAT 


CCGCCACAAC 


ATCGACCACC 


551 


GCAGCGTGCA 


GCTCGCCCAC 


CACTACCACC 


AGAACACCCC 


GATCGGCGAC 


601 


GCCCCCGTGC 


TGCTGCCCGA 


CAACCACTAC 


CTGAGCACCC 


AGTCCGCCCT 


651 


GAGCAAAGAC 


CCCAACCACA 


AGCCCGATCA 


CATGGTCCTG 


CTGGAGTTCG 


701 


TCACCGCCGC 


CCGCATCACT 


CACGGCATGG 


ACGAGCTCTA 


CAAGTAAAGC 


751 


GGCCG CGGAT 


CC (SEQ ID NO: CO) 
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What is claimed is: 

1. A synthetic gene encoding a protein normally 
expressed in a eukaryotic cell wherein at least one non- 
preferred or less preferred codon in the natural gene 

5 encoding said protein has been replaced by a preferred 
codon encoding the same amino acid. 

2. The synthetic gene of claim l wherein said 
synthetic gene is capable of expressing said eukaryotic 
protein at a level which is at least 110% of that 

10 expressed by said natural gene in an in vitro mammalian 
cell culture system under identical conditions. 

3. The synthetic gene of claim 1 wherein said 
synthetic gene is capable of expressing said eukaryotic 
protein at a level which is at least 150% of that 

15 expressed by said natural gene in an in vitro cell 
culture system under identical conditions. 

4. The synthetic gene of claim 1 wherein said 
synthetic gene is capable of expressing said eukaryotic 
protein at a level which is at least 200% of that 

20 expressed by said natural gene in an in vitro cell 
culture system under identical conditions. 

5. The synthetic gene of claim 1 wherein said 
synthetic gene is capable of expressing said eukaryotic 
protein at a level which is at least 500% of that 

25 expressed by said natural gene in an in vitro cell 
culture system under identical conditions. 

6. The synthetic gene of claim 1 wherein said 
synthetic gene is capable of expressing said eukaryotic 
protein at a level which is at least ten times that 
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expressed by said natural gene in an in vitro cell 
culture system under identical conditions. 

7. The synthetic gene of claim 1 wherein at least 
10% of the codons in said natural gene are non-preferred 

5 codons . 

8. The synthetic gene of claim 8 wherein at least 
50% of the codons in said natural gene are non-preferred 
codons . 

9. The synthetic gene of claim 1 wherein at least 
10 50% of the non-preferred codons and less preferred codons 

present in said natural gene have been replaced by 
preferred codons. 

10. The synthetic gene of claim 1 wherein at 
least 90% of the non-preferred codons and less preferred 

15 codons present in said natural gene have been replaced by 
preferred codons. 

11. The synthetic gene of claim 1 wherein said 
protein is green fluorescent protein. 

12. A method for preparing a synthetic gene 
20 encoding a protein normally expressed by eukaryotic 

cells, comprising identifying non-preferred and less- 
preferred codons in the natural gene encoding said 
protein and replacing one or more of said non-preferred 
and less-preferred codons with a preferred codon encoding 
25 the same amino acid as the replaced codon. 
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I CTCGACATCC ATTC7GCTC? AAACCAGATA CCCGCCCACA CACCCTCACC 

5; 7GCGG7GCCC AGC7GCCCAC GCTGAGCCAA GAGAAGGCCA GAAACCAT/CC 

101 CCA7GGGGTC 7C7GCAACCG CTGGCCACCT TGTACCTTGCT GGOZATC-CZG 

151 C7CGC77CC3 TGCTACCCAC CCAGAAGCTC TGCC7CACCG 7GTACTACGC . 

201 CG7CCGCC7G TC-GAACCAGG CCACCACCAC CCTGTTCTGC GCCACCGACG 

251 CCAAGGCGTA CGACACCGAG G7CCACAACG TGTGCGCCAC CCAGGCG7CC 

301 G7GCCCACCG ACCCCAACCC CCAGGAGG7G GAGC7CG7GA ACG7GACCGA 

351 CAACTTCAAC ATG7GGAAGA ACAACATGG7 GGAGCAGA7G CATGAGGACA 

401 7CA7CAGCC7 GTGGGACCAG AGCC7GAACC CCTGCGTGAA GCTGACCCCC 

4 51 C7G7GCGTGA C Z77GAAC7G CACCGACC7G ACGAACACCA CCAACACCAA 

5C1 CAACAGCACC GZGAACAACA ACACCAACAG CGAGGGCACG A 7C AAGGGCG 

551 GCCAGA7GAA C^C7GCAGC TTCAACATCA CCACCAGCAT CCGCGACAAG 

601 ATGCAGAAGG- A37ACGCCC7 CC7C7ACAAG C7GGATATCG 7GAGCATCCA 

651 CAACGACACC AGCACCTACC GCC7GATC7C CTGCAACACC AGCGTGATCA 

7 CI CCCAGGCCTG qCCCAACATC AGCTTCGAGC CCA7CCCCAT CCACTACTCC 

751 GCCCCCGCCG C^TTCCCCAT CCTGAAGTGC AACGACAACA AC7TCAGCGG 

301 CAAGGGCAGC TGCAAGAACG TGAGCACCG7 GCAGTGCACC CACGGCATCC 

8:1 GGCCGG7GG7 (^AGCACCCAG C7CG7CC7GA ACGGCAGCCT GGGCGAGGAG 

9C1 GACG7CG7CA TCCGCAGCGA GAACTTCACC CACAACGCCA AGACCATCAT 

951 CG7GCACCTG AATGACAGCG 7CCAGATCAA C7GCACGGG7 CCCAACTACA 

1001 ACAAGCGCAA QCGCA7CCAC A7CCGCCCCG CGCCCGCG7T CTACACCACC 

12S i AAGAACATCA TCGGCACCAT CCGCCAGGCC CACTGCAACA TCTCTAGAGC 

1101 CAAGTGGAAC CACACGGTGC CCCAGA7CGT GAGCAAGC7C AAGGACCAGT 

1151 7CAAGAACAA GACCATCG7G 7TCAACCAGA GC^CCGCCGG CGACCCGGAC 

1201 A7CG7GATGC ACAGG77CAA C7GCGGCGGC GAAT7C7TC7 AC7CCAACAC 

12 51 CACGCCCCTG T7CAACAGCA CCTGGAACGG CAACAACACC 7GGAACAACA 
1301 CCACCCGCAC CAACAACAAT A7TACCC7CC AC7CCAAGA7 CAAGCAGATG 

13 5: A7CAACA7GT GGCAGGAGG7 GGCCAAGCCC A7C7ACGCCC CCCCCATCGA 
U01 GGGCCAGATC CGG7GCAGCA CCAACA7CAC CGC7C7CCTG C7CACCCCCG 

14 51 ACGGCGGCAA CCACACGGAC ACCAACCACA CCGAAA7CT7 GCGCCCCGGC 
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150 1 GCCGCCGACA TGCGCCACAA CTCGAGATCT 
15S1 GCTGACGATC Q\CCCCCTGG GCGTGGCCCC 
:6QI 7GG7GCAGCC CSAGAAGCGC TAAAGCGGCC 
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GAGCTCTACA AGTACAACCT 
CACCAAGGCC AACCGCCCCG 
GC (SEQ ID SO: 34) 



(suszr i of 4) 
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1 ACCCAGAJVCC 7^7CGG7GAC CT7G7AC7AC GGCCTGCCCS 7G7GGAAGGA 
:1 CGCCACCACC ACCCTC7TCT CC3CCAGCGA CGCCAAGGCG 7ACGACACCG 
IZl ACG7GCACAA CTTGTGCGCr ACCCACCCC? CCCTCCCCAC C3ACCCCAAC 
: c_ -crCACCACG T^GACCTr^T SAACCTfUrX GAGAAC77CA ACA7G7CGAA 
2C1 GAACAACA7C C7GGACCAGA 7GCATGAGGA CA7CATCAGC C7G7GGGACC 
: = i AGAGCC7GAA C^ZZZTC.r.CrC AAGCTGACCC CC77G73CG7 CACCC7CAAC 
3:i TGCACCGAr- Tr.AGGAACAC CACCAACACC AACAACACCA CCGCCAACAA 
;;1 CAACA^CAA.: AGCGAGGGCA CCA7CAAGGG CGGCGACA7G AAGAACTGCA 
121 r.~ CAACA7 CACrTACCAGC A7CC3CGACA AGA7CCAGAA CGA^ACCrC 
i*\ — SC7G7ACA A3C7CGA7A7 CG7CAGCA7C CACAACCACA UCACCACC7A 

ssi 

SO 1 A7C773AAC7 qCAACCAC-V. GAAC7TCAGC GGCAAGGGCA G77GCAAGAA 

55: CG7CACCAC7 : 7GCAGTGCA CC7ACGCCA7 CCCCCCGC7G C7GAGCACCC 

7C1 ACC7C77CC7 GAAwCCCACC C7CCCC3AGG ACCACGTC-'T CATCCr.CAGC 

7il CAGAACrrJA CCCACAACGC CAAGACZA7C A7CC7CCACr. TGAA7GAGAG 

3CI L*GTGCACA7C AAC7CCACGC C7GCCAACTA CAATAAGCGC AACCGCA7C3 

951 ACA7CGCCCC CGGGC3CGCC 77CTACACCA CCAAGAACA7 CA7CGGCACC 

901 A7CGGCCAGG C3CAC7GCAA CA7C77TAGA GCCAAG7GGA ACGACACC77 

3:1 CCCCCACA73 QTGACCAAT.r TCAAGGAGCA G77CAACAAC AACACCATCC 

122 1 7CT7CAAC.7A GAC.r.AGCCCC CGCGACCCCG AGA7CG7CA7 CCACACC77C 

10 31 AACTCr—rG GkGAA77C77 C7ACTGCAAC ACCACCCCC3 7G7TCAAC AG 

110*. CACCTGGAAC QGC AACAACA CC7GGAACA.V CACCACCJ'JGC AGCAAGAACA 

11*1 A7A77ACC37 CCAG7GCAAG A77AAGCACA 1CA7CAACA7 G7GGCAGCAG 

1201 G7GCGCAACG CCA7G7ACCC CC7C73CA71- CACCCCCAGA 7CCGC7GCAC 

12 51 CAGCAACA7C ACCGC7C7GC 7GC7GACCCG CZXCZCCZZZ AACV5ACACCC 

lid ACACC.V^-A CACCCAAATC 773CGCG7CG SCSCC-Wr-A CA7GCCCCAC 

1351 AAC7GGACA7 C7GAGC7G7A CAAG7ACAAr. G7GG7GACGA 7CGAGCCCC7 

113 1 CZZZZ'ZZZZ CCCACCAACG 7CA AHr.r.CCG CG7GG7GCAC CGCGAGAACC 
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14 51 GGGCCGCCA7 C^GCGCGCTG 7TCCTCGGCT 7CCTCCGCGC CGCGOGCAGC 
1501 ACCATGGCGG CCGCCACCGT GACCCTGACC GTGCAGGCCC CCC7CCTCCT 
1551 GAGCGGCA7C CtTGCAGCAGC AGAACAACC7 CC7CCSCGCC ATCCAGCCCC 
1601 ACCACCA7AT CCTCCAGCTC ACCGTGTGGC GCA7CAAGCA GCTCCAGGCC 
1551 CGCC7CC7GG CCGTGGAGCG CTACCTGAAG GACCAGCACC TCCTGGGCT7 
1701 CTGGCCC7CC 7CCCGCAAGC TSATCTGCAC CACIACGC7A QCC7GGAACG 
17 £1 CC7CC7GGAG CAACAACACC C7GGACGACA 7C7GGAACAA CA7GACC7GG 
15CL ATGCAC7GGG A.ICCCGAGA7 CGA7AAC7AC ACCAGCC7GA 7C7ACAGCC7 

15 51 GCTCGAGAAC ^ZZCXOkZZZ AGCAGGACAA GAACGACCAG GAGCTGCTGG 
IrCl ACCTCCACAJ* C"GGGCGACC C7C7GGAAC7 GGT7CGACA7 CACCAACTCG 
19 51 C7G7GG7ACA r:AAAA7C77 CA7CATGA77 G7GGGCGGCC 7GG7GGGCCT 



CCGCA77GTG T7CGCCG7G- 7CAGCATCG7 GAACCGCG7G ZZZZAGZZZZ 




2 , - , CGC7CCGAGG CCA7CGAGGA GGACGCCGGC GAGCGCGACC GCGACACCAG 

2 151 GGGCAGGC7C C7GCACGGC7 7CCTGGCGAT CATC7GGG7C GACC7CCGCA 

2201 GCC7C7TC77 q7TCAGC7AC CACCACCGCG AC77GCTGCT GA7CCCCGCC 

2251 CGCA7CC7GG AAC7CCTAGG CCGCCGCCGC 7GGGAGG7GC 7GAAGTACTG 

2301 G7GGAAC777 C7CCAGTA77 GGAGCCAGGA GC7CAAGTCC ACCGCCGTGA 

2351 CC-7GC7GAA CGCCACCCCC A7CCCCG7GG C7GACGGCAC CGACCGCG7G 

2 401 A7CGAGG7CC TCCAGAGGGC CGGGAGGGCG A7CC7GCACA 7CCC7ACCCG 

24 51 CATCCGCCAG J :GGCTCGAGA GGGCGC7GC7 G (SEQ ID NO: 35) 
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FIGURE 3 
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FICURE 9 
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1 


GAATTCACGC 


GTAAGCTTGC 


CGCCACCATG 


GTGAGCAAGG 


GCGAGGAGCT 


51 


GTTCACCGGG 


GTGGTGCCCA 


TCCTGGTCGA 


GCTGGACGGC 


GACGTGAACG 


101 


GCCACAAGTT 


CAGCGTGTCC 


GGCGAGGGCG 


AGGGCGATGC 


CACCTACGGC 


151 


AAGCTGACCC 


TGAAGTTCAT 


CTGCACCACC 


GGCAAGCTGC 


CCGTGCCCTG 


201 


GCCCACCCTC 


GTGACCACCT 


TCAGCTACGG 


CGTGCAGTGC 


TTCAGCCGCT 


251 


ACCCCGACCA 


CATGAAGCAG 


CACGACTTCT 


TCAAGTCCGC 


CATGCCCGAA 


301 


GGCTACGTCC 


AGGAGCGCAC 


CATCTTCTTC 


AAGGACGACG 


GCAACTACAA 


351 


GACCCGCGCC 


GAGGTGAAGT 


TCGAGGGCGA 


CACCCTGGTG 


AACCGCATCG 


Art! 


^GCTGAAGGG 


CATCGACTTC 


AAGGAGGACG 


GCAACATCCT 


GGGGCACAAG 


451 


CTGGAGTACA 


ACTACAACAG 


CCACAACGTC 


TATATCATGG 


CCGACAAGCA 


501 


GAAGAACGGC 


ATCAAGGTGA 


ACTTCAAGAT 


CCGCCACAAC 


ATCGAGGACG 


551 


GCAGCGTGCA 


GCTCGCCGAC 


CACTACCAGC 


AGAACACCCC 


CATCGGCGAC 


601 


GGCCCCGTGC 


TGCTGCCCGA 


CAACCACTAC 


CTGAGCACCC 


AGTCCGCCCT 


651 


GAG C AAAG AC 


CCCAACGAGA 


AGCGCGATCA 


CATGGTCCTG 


CTGGAGTTCG 


701 


TGACCGCCGC 


CGGGATCACT 


CACGGCATGG 


ACGAGCTGTA 


CAAGTAAAGC 


751 


GGCCGCGGAT 


CC (SEQ ID NO: 40) 
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